Privacy server for protecting personally identifiable information

ABSTRACT

A privacy server protects private information by substituting a token or an identifier for the private information. The privacy server recognizes that a communication includes private information and intercepts the communication. The privacy server replaces the private information with a random or pseudo-random token or identifier. The privacy server maintains the private information in a local database and associates the private information for a particular person with the token or identifier for that person.

CROSS-REFERENCES TO RELATED APPLICATIONS

This application is a continuation application of U.S. Non-Provisional patent application Ser. No. 14/614,279 filed on Feb. 4, 2015, which is a continuation application of and claims priority to U.S. Non-Provisional patent application Ser. No. 13/655,507, filed on Oct. 19, 2012. Each of these applications is hereby incorporated by reference in its entirety for all purposes.

FIELD OF THE INVENTION

The invention generally relates to the protection of personally identifiable information and more particularly to the substitution of a token or identifier for the personally identifiable information by a privacy server.

BACKGROUND OF THE INVENTION

The protection of personally identifiable information (PII) is of concern as more and more information is stored and shared electronically. There are a number of laws that govern how PII can be used and how it must be protected. For example, some Canadian provinces have enacted laws to address how private electronic information, such as PII, collected by public Canadian institutions can be handled. These laws require that private data not cross provincial or Canadian borders, or be accessed by citizens or authorities of other countries. These types of laws may limit the ability of provincial residents to use applications or services that are hosted outside of the province if the application requests PII. It is not uncommon for the server that hosts an application or service to be located in a different jurisdiction from the user of the application or service. For example, a provider of a learning management system (LMS) may host the software for the LMS on a server in one jurisdiction, but serve students in a number of other jurisdictions. However, the LMS may not be able to serve students of an institution located in a jurisdiction that prohibits the transmission of the student's PII outside the jurisdiction when the LMS is located outside the jurisdiction.

To accommodate laws that prohibit the transmission of PII outside the jurisdiction or otherwise require special handling of PII, a provider can choose to host its application or service within that jurisdiction or to provide specific applications or services to address the special handling required by that jurisdiction. However, these approaches increase cost and complexity for the provider. Alternatively, a user or institution can choose to limit themselves to only those applications and services that are hosted within their jurisdiction or that provide the special handling of PII, but in doing so may deprive themselves of access to the best available resources.

Since users want access to the best available applications and solutions regardless of where they may be hosted, there is a need for a solution that protects PII without requiring separate hosting or special handling for different jurisdictions.

BRIEF SUMMARY OF THE INVENTION

Aspects of the invention relate to a privacy server and methods of operating the privacy server to protect private information. A privacy server interfaces with a user's computer system and another server, such as an application server. The privacy server protects private information received from the user's computer system by sending a token or identifier to the application server instead of the private information.

The privacy server recognizes when the user is communicating private information to the application and intercepts the communication. The privacy server replaces the private information with a random or pseudo-random token or identifier. The privacy server maintains the private information in a local database and associates the private information for a particular person with a token or identifier for that person. Communications from the application to the user also pass through the privacy server. If a communication includes the token or identifier, then the privacy server intercepts the communication and replaces the token with the user's private information prior to presenting the communication to the user.

Other features, advantages, and objects of the present invention will be apparent to those skilled in the art with reference to the remaining text and drawings of this application.

BRIEF DESCRIPTION OF THE DRAWINGS

These and other features, aspects, and advantages of the present disclosure are better understood when the following Detailed Description is read with reference to the accompanying drawings, where:

FIG. 1 is a block diagram illustrating an exemplary privacy server.

FIG. 2 is a block diagram illustrating an exemplary registration process using a privacy server.

FIG. 3 is a block diagram illustrating another exemplary registration process using a privacy server.

FIG. 4 is a block diagram illustrating yet another exemplary registration process using a privacy server.

FIG. 5 is a flow diagram illustrating an exemplary process for creating a token.

FIG. 6 is a block diagram illustrating an exemplary web page process using a privacy server.

FIG. 7 is a block diagram illustrating another exemplary web page process using a privacy server.

FIG. 8 is a block diagram illustrating an exemplary e-mail process using a privacy server.

FIG. 9 is a block diagram illustrating another exemplary e-mail process using a privacy server.

DETAILED DESCRIPTION OF THE INVENTION

Aspects of the present invention are directed to a privacy server that maintains private information at the privacy server while using applications or services hosted on other servers. In some instances, the privacy server can be used to maintain private data within one jurisdiction even though a user is accessing an application hosted on a server located outside the jurisdiction. The privacy server intercepts communications between a user and an application that include private information. The privacy server creates a token or identifier that does not disclose the private information and uses the token or identifier instead of the private information when communicating with other servers and systems. In some instances, the operation of the privacy server in substituting a token or identifier for private information may be transparent to both the user and to the other servers and systems.

The scope of private information may vary, but generally includes any information unique to an individual, such as name, home address, opinions, educational records, age, gender, income, medical records, and/or financial data. The terms private information and personally identifiable information (PII) are used interchangeably herein. Information that is not private, i.e., not identified or linked to an individual is referred to herein as anonymous.

Exemplary Operating Environment

FIG. 1 illustrates an exemplary relationship between the user's computer system 102, the privacy server 104, and the application server 106. Both the user's computer system 102 and the privacy server 104 are located in the same jurisdiction. The application server 106 is located in a different jurisdiction. FIG. 1 illustrates that the user's computer system 102 and the privacy server 104 are located in Canada and that the application server 106 is located in the United States. Although FIG. 1 illustrates that the jurisdictions are based on country boundaries, other types of jurisdictional boundaries may be used. For example, if a state or a province has privacy requirements that are more stringent than the applicable national privacy requirements, then the user's computer system and the privacy server may be in one state or province and the application server may be in a different state or province, but all may be located in the same country.

Although not shown in FIG. 1, the application server can be connected to additional privacy servers and/or connected directly to additional user systems. The additional privacy servers and/or user systems may be located in the same or additional jurisdictions. The privacy server may be configured to allow it to interact with an application or application server without the application or application server recognizing that it is interacting with the privacy server.

A user may access an application hosted on the application server, such as a cloud-based application, i.e., an application resident on the application server and accessible via a network, such as the Internet. The user's communications with the application pass through the privacy server. The privacy server recognizes when the user is communicating PII to the application and intercepts the communication. The privacy server replaces the PII with a random or pseudo-random token or identifier. The privacy server maintains the PII in the local PII database 105 and associates the PII for a particular user with an identifier for that user. An identifier, such as a PII identifier, maybe a random or pseudo-random string. The privacy server may decorate the PII identifier to create a token. The application hosted by the application server receives the token from the privacy server and uses it to identify a user. In some instances, the application does not distinguish between a received token and received PII, which may allow an existing application to work with users that access the application via a privacy server, as well as users that access the application directly.

The application may maintain the token in the database associated with the application server, illustrated by the LMS database in FIG. 1. If the application is capable of receiving both tokens and PII, then the tokens are maintained in a manner similar to that used to maintain PII. The LMS database is distinct from the PII database so that the application does not receive or use the user's PII.

Any communication from the application to the user also passes through the privacy server. If the communication includes a token, then the privacy server intercepts the communication and replaces the token with the user's PII prior to presenting the communication to the user. In this manner, the existence and operation of the privacy server is transparent to the user.

Although FIG. 1 illustrates one possible configuration, the features discussed herein are not limited to any particular hardware architecture or configuration. The user's computer system, the privacy server and the application server may include a computing device, as well as a non-transitory computer-readable medium capable of storing code and may be capable of the operations described herein. One example of a computing device is a multipurpose computer system capable of executing software or other code. Examples of non-transitory computer-readable medium include electronic, optical, magnetic, or other storage device capable of storing computer-readable instructions. Other examples include, but are not limited to, a floppy disk, CD-ROM, DVD, magnetic disk, memory chip, ROM, RAM, an ASIC, or any other medium from which a computer processor can read instructions. The user's computer system, the privacy server, and the application server may communicate via any type of a network including, but not limited to, a wired or wireless network.

Exemplary Operation

FIG. 1 illustrates that the privacy server may include modules to support a PII registration process 112, as well as a PII proxy process 110 and an e-mail relay process 114. The operation of the privacy server will now be described with reference to a registration process where the privacy server generates and uses a token. A non-limiting example of an educational application, such as a Learning Management System (LMS), that is capable of using student PII is used for illustration.

In this example, a teacher registers a student by entering the student's information, including PII, via a system, such as the user's computer system 102 of FIG. 2. The teacher may enter the information by uploading a spreadsheet that includes information for one or more students or may enter the information via a registration form or page. The privacy server recognizes that student information is being entered as part of the registration process and intercepts the information. In some instances, the privacy server recognizes that a spreadsheet has been uploaded that is associated with PII. In other instances, the privacy server recognizes that the teacher is accessing a registration form or page. In one exemplary implementation, the privacy server is programmed to recognize that a particular form or web page is being loaded and intercepts the PII entered into the form or web page prior to communicating it to the application server.

The teacher is unaware that the privacy server is intercepting any student information. The teacher interacts with the application in the same manner as the teacher would if there was no privacy server. As will be apparent to those skilled in the art, there are other ways that the privacy server may recognize PII, including, but not limited to being configured to recognize certain actions or sequences associated with a user's interaction with the application or to recognize certain types of information.

Once the privacy server intercepts the PII, the privacy server saves the PII locally and generates a PII identifier. In this example, the PII includes the student's name, John Smith, and the student's e-mail address, “john smith@myschool.edu.ca”. The student's name and e-mail address are saved in the PII database and are associated with the PII identifier, which in this example is “12345”.

The PII identifier is a random or pseudo-random character string. The character string can be an alphabetic character string, a numeric character string, or an alpha-numeric character string. The PII is not used to generate the PII identifier. Instead other types of information, including, but not limited to the time of day or a portion of the network address of the user's computer system may be used to generate the PII identifier. The PII identifier uniquely identifies the student or other entity within the scope of the privacy server. The PII identifier may be generated by a computer-implemented method provided by the privacy server. One example of a PII identifier is a GUID or globally unique identifier. In some instances the PII identifier includes designated characters that can be used for sorting. For example, a PII identifier may include characters that represent the first few letters of the student's sur name to support sorting alphabetically by sur name. In this instance the PII identifier is not completely random, but still protects the student's PII.

As shown in FIG. 2, the privacy server 102 may use attributes or attribute codes to identify different types of PII. For example, the student's full name may be associated with an attribute for full name and may be identified by an attribute code of “FN”, the student's sur name may be may be associated with an attribute for sur name and identified by an attribute code of “SN”, and the student's given name may be associated with an attribute for given name and identified by an attribute code of “GN”. Attributes and attribute codes may be helpful when the PII identifier represents more than one type of information. The manner in which the student's PII is partitioned and the types of attributes and attribute codes are typically based upon the requirements of the application. If the application uses a student's given name, sur name, and full name, then the attributes and attribute codes may be set up as illustrated in FIG. 2.

FIG. 2 illustrates that the privacy server may create a token by decorating the PII identifier. The privacy server may decorate the token by adding a start code and/or an end code to the beginning and/or ending of the PII identifier to indicate the beginning and/or ending of the token. FIG. 2 illustrates a start code of “#@” and an end code of “@#”. However, any suitable start or end code may be used. In the alternative or in addition, the privacy server may decorate the token by adding an attribute code to the PII identifier to identify the type of PII. For example, FIG. 2 illustrates that the privacy server may add “FN” to indicate the student's full name, “SN” to indicate the student's sur name, and/or “GN” to indicate the student's given name.

The privacy server sends the token to the application server 106 instead of the student's PII. For example, if the teacher uploaded a spreadsheet with a student's full name, then the privacy server replaces the student's full name with a token and sends the spreadsheet with the token to the application server. In this example, the token may be “#@FN:12345@#”. Similarly, if the teacher enters student PII into a registration form, then the privacy server replaces the student's full name with a token before sending the registration form to the application server. Only the token is sent to the application server. The privacy server does not send the student's PII to the application server.

The application receives the spreadsheet or registration form from the privacy server and registers the student with the application by storing the token in the LMS database 107. The application treats the token as a student identifier. The token is maintained in the LMS database so that the student's performance and progress can be tracked. In some instances the application, receives tokens for some students and PII for other students. If so, then the application treats the tokens in the same manner that it treats student PII. One benefit of the privacy server may be that the application does not need to be changed to protect PII since the protection is provided by the privacy server.

In other instances, the application server only stores the PII identifier, not each token. One example of this is shown in FIG. 3 where the application server includes a data layer 302. The data layer receives a token, removes the decoration, i.e., removes any start/end codes and any attribute codes, and passes only the PII identifier to the database 107 for storage. The data layer also passes the e-mail domain of the privacy server to the database so that it is associated with the PII identifier. Prior to the application server communicating with the privacy server, the data layer may decorate the PII identifier so that the appropriate token is sent to the privacy server.

In yet another instance, the privacy server does not decorate the PII identifier. As illustrated in FIG. 4, the privacy server creates a PII identifier, “12345”, for a student and sends the PII identifier and the e-mail domain of the privacy server, “privacy server.com.ca”, to the application server. The data layer stores the PII identifier and the e-mail domain in the local database. The data layer uses the PII identifier and the e-mail domain to create an e-mail address when sending an e-mail to the student. In the examples illustrated by FIGS. 3 and 4, the application is aware that it is receiving a PII identifier from the privacy server and the data layer is capable of decorating the PII identifier or assembling an e-mail address using the PII identifier.

In some instances, the user may provide information or data to the application that does not need to be protected, such as anonymous information. If so, the privacy server allows the anonymous information or data to pass unaltered to the application server. If PII is provided along with anonymous information that does not need privacy protection, then the privacy server only substitutes a token or PII identifier for the PII and allows the anonymous information to pass unaltered to the application server. For example, if a registration form requests a student's name in one information field and a class name in another information field, then the privacy server may replace the student's name with a token, but allow the class name to pass through to the application server.

When the privacy server receives a communication from the application server that includes a token, the privacy server uses the token to locate the PII that corresponds to the token stored in the PII database and substitutes the appropriate PII. For example, if the teacher requests a report for a class of students, then the teacher may provide a class identifier, such as a class name or course number to the application. The application generates a report that includes the tokens for the names of the students in the class. The privacy server intercepts the report and substitutes the students' names for the tokens prior to providing the report to the teacher.

In some implementations, the privacy server determines that a communication includes a token by scanning the communication for token delimiters, such as a start and/or end code. In other implementations, the privacy server may be designed to scan particular document types, particular documents or particular web pages for tokens.

An exemplary method for generating a token is illustrated by FIG. 5. The method starts when the privacy server receives a communication from the user's computer system in 502. The privacy server determines whether the communication includes PII in 504. For example, the privacy server may determine whether the communication includes a field associated with PII. If the communication does not include PII, then the method proceeds to 506 and the communication is forwarded without any modification. If the determination in 504 is that the communication includes PII, then the method proceeds to step 508 and the privacy server intercepts the communication. The privacy server extracts the PII in 510 and creates a PII identifier in 512. The privacy server associates the PII identifier with the PII in 514 and stores the PII identifier and the PII in a local database in 516. The privacy server creates a token by decorating the PII identifier in 517. Alternatively, the privacy server may use the PII identifier instead of a token. The privacy server substitutes the token for the PII in 518 and then forwards the communication with the token in 520 to the application server.

The privacy server may also serve as a proxy between a user and the application, as well as other applications or services since all communications from the user pass through the privacy server. This function is similar to that currently used to filter web traffic. When a user requests a web page, the web page request passes through the privacy server. If the web page request includes PII, then the privacy server replaces with PII with a token or PII identifier before the request is forwarded to the appropriate server.

Regardless of whether the web page request includes PII, the web page returned from the application server may include a token or PII identifier. The local database for the application server stores a token or a PII identifier instead of PII. If the web page includes an information field associated with PII, such as a name field, then the application server inserts the token or PII identifier into the field. The token or PII identifier can be retrieved from the local data base or the data layer can retrieve a PII identifier from the local data base and decorate it. When the privacy server receives the web page, it replaces the token or PII identifier with the appropriate PII prior to providing the web page to the user. In this manner the user receives a web page which includes the user's PII, even though the PII was never provided to the application server.

FIG. 6 illustrates an exemplary web page request. The user sends a communication to the application server requesting the web page. The communication does not includes any PII so the communication is forwarded to the application server without any modifications. The application server determines that the web page is to include the full name of a student known to the application as “12345”. The application server accesses the LMS database to retrieve the full name of student “12345”. In this case, the LMS database includes a token for the full name of the student “#@FN:12345@#”. The application server includes the token in the field of the web page associated with the full name of the student. When the privacy server receives the web page, the privacy server recognizes that the web page contains a token and replaces the token with the appropriate PII. In this case the privacy server replaces the token “#@FN:12345@#” with the student's full name “John Smith” prior to presenting the web page to the user.

FIG. 6 illustrates that the LMS database stores a token. In other instances, the LMS database may store the PII identifier and the data layer may decorate the PII identifier. For example, FIG. 7 illustrates the situation where the application server determines that the web page is to include the full name of a student known to the application as “12345”. The data layer checks the LMS database to determine whether the database includes information for student “12345”. If student “12345” exists in the LMS database, then the data layer decorates “12345” prior to sending the web page to the privacy server. In the example illustrated by FIG. 7, the data layer decorates “12345” with start and end codes, as well as an attribute code for the student's full name.

The privacy server may also act as an e-mail relay since it can substitute an e-mail address the uses a PII identifier and the privacy server's e-mail domain for the user's e-mail address or forward an e-mail to the user's e-mail address that was received at the privacy server. FIG. 8 illustrates an example where the application initiates communication with a student or other user by sending an e-mail. Since the student is only identified to the application by a PII identifier, the application looks up the e-mail address for the student using the PII identifier. The application determines that the e-mail address for the student identified by PII identifier “12345” is “12345@privacy_server.com.ca”. The application sends an e-mail to the student using that address. When the e-mail is received at the privacy server, the privacy server recognizes that the e-mail address includes the privacy server's e-mail domain. The privacy server uses the PII identifier to determine the corresponding e-mail address for the student and then substitutes the student's e-mail address for the received e-mail address. In the example illustrated by FIG. 8, the privacy server substitutes “john smith@myschool.edu.ca” for “12345@privacy_server.com.ca” before sending the e-mail on to the student. The student receives an e-mail addressed to the student's e-mail address with content created by the application even though the application did not know the student's e-mail address.

FIG. 9 illustrates another example where the application initiates communication with a student or other user by sending an e-mail. Since the student is only identified by a PII identifier, the data layer looks up the e-mail address for the student using the PII identifier. The data layer determines that the student identified by PII identifier “12345” is associated with a privacy server that has an e-mail domain of privacy_server.com.ca. The data layer then addresses the e-mail to “12345@privacy_server.com.ca”. When the e-mail is received at the privacy server, the privacy server uses the PII identifier to determine the corresponding e-mail address for the student and then substitutes the student's e-mail address for the received e-mail address. In the example illustrated by FIG. 9, the privacy server substitutes “john_smith@myschool.edu.ca” for “12345@privacy_server.com.ca” before sending the e-mail on to the student. The student receives an e-mail addressed to the student's e-mail address, but created by the application even though the application did not know the student's e-mail address.

The foregoing description of exemplary embodiments of the invention has been presented only for the purposes of illustration and description and is not intended to be exhaustive or to limit the invention to the precise forms disclosed. Many modifications and variations are possible in light of the above teaching. The embodiments were chosen and described in order to explain the principles of the invention and their practical application to enable others skilled in the art to utilize the invention and various embodiments with various modifications as are suited to the particular use contemplated. Alternative embodiments will become apparent to those skilled in the art to which the present invention pertains without departing from its spirit and scope. For example, although the examples describe that the user is a teacher entering information about students, the user could be entering information about itself. In addition, the invention is not limited to a LMS or other educational application, but can be used with any system where privacy or protection of PII is a concern. 

What is claimed is:
 1. A non-transitory computer-readable medium capable of storing code for performing operations including: receiving a communication from a first computing device located in a first jurisdiction, wherein: the first jurisdiction includes geography that is not included in a second jurisdiction, the second jurisdiction includes other geography that is not included in the first jurisdiction, and content of the communication includes a name and identifier of a user associated with the first jurisdiction or the second jurisdiction; analyzing the communication to determine whether the communication is intended to be transmitted to the second jurisdiction; determining that the content of the communication includes personally identifiable information associated with laws that restrict transmissions of the personally identifiable information; and when the content of the communication includes the personally identifiable information associated with the laws that restrict transmissions of the personally identifiable information: creating a token to replace the personally identifiable information in the content of the communication, substituting the personally identifiable information in the content of the communication with the token, and transmitting the communication with the token instead of the personally identifiable information to a second computing device associated with the second jurisdiction.
 2. The non-transitory computer-readable medium of claim 1, wherein the operations further include: storing the token in a local database.
 3. The non-transitory computer-readable medium of claim 1, wherein the token is a pseudo-random token.
 4. The non-transitory computer-readable medium of claim 1, wherein the token is substituted for a full name of the user.
 5. The non-transitory computer-readable medium of claim 1, wherein the name of the user is substituted with the token from the communication and a class name or course number is not substituted with the token from the communication.
 6. The non-transitory computer-readable medium of claim 1, wherein the operations further include: intercepting the communication.
 7. The non-transitory computer-readable medium of claim 1, wherein the token includes a start code and an end code.
 8. A method comprising: receiving the communication from a first computing device located in a first jurisdiction, wherein: the first jurisdiction includes geography that is not included in a second jurisdiction, the second jurisdiction includes other geography that is not included in the first jurisdiction, and content of the communication includes a name and identifier of a user associated with the first jurisdiction or the second jurisdiction; analyzing the communication to determine whether the communication is intended to be transmitted to the second jurisdiction; determining that the content of the communication includes personally identifiable information associated with laws that restrict transmissions of the personally identifiable information; and when the content of the communication includes the personally identifiable information associated with the laws that restrict transmissions of the personally identifiable information: creating a token to replace the personally identifiable information in the content of the communication, substituting the personally identifiable information in the content of the communication with the token, and transmitting the communication with the token instead of the personally identifiable information to a second computing device associated with the second jurisdiction.
 9. The method of claim 8, further comprising: storing the token in a local database.
 10. The method of claim 8, wherein the token is a pseudo-random token.
 11. The method of claim 8, wherein the token is substituted for a full name of the user.
 12. The method of claim 8, wherein the name of the user is substituted with the token from the communication and a class name or course number is not substituted with the token from the communication.
 13. The method of claim 8, further comprising: intercepting the communication.
 14. The method of claim 8, wherein the token includes a start code and an end code.
 15. A privacy server comprising: a processor; and a memory device including instructions that, when executed by the processor, cause the processor to: receive a communication from a first computing device located in a first jurisdiction, wherein: the first jurisdiction includes geography that is not included in a second jurisdiction, the second jurisdiction includes other geography that is not included in the first jurisdiction, and content of the communication includes a name and identifier of a user associated with the first jurisdiction or the second jurisdiction; analyze the communication to determine whether the communication is intended to be transmitted to the second jurisdiction; determine that the content of the communication includes personally identifiable information associated with laws that restrict transmissions of the personally identifiable information; and when the content of the communication includes the personally identifiable information associated with the laws that restrict transmissions of the personally identifiable information: create a token to replace the personally identifiable information in the content of the communication, substitute the personally identifiable information in the content of the communication with the token, and transmit the communication with the token instead of the personally identifiable information to a second computing device associated with the second jurisdiction.
 16. The privacy server of claim 15, wherein the processor is further caused to: store the token in a local database.
 17. The privacy server of claim 15, wherein the token is a pseudo-random token.
 18. The privacy server of claim 15, wherein the token is substituted for a full name of the user.
 19. The privacy server of claim 15, wherein the name of the user is substituted with the token from the communication and a class name or course number is not substituted with the token from the communication.
 20. The privacy server of claim 15, wherein the processor is further caused to: intercept the communication. 